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CHAPTER  ~ 

INTRODUCTION 

In  many  physical  problems  of  interest,  with  sonar,  radar, 
and  .seismology  as  examples,  the  time  records  of  the  outputs 
of  an  array  '>?  sensors  are  observed  over  some  time  interval 
i'  used  t's  estimate  the  position  of  a  distant  noise  source, 
ypically,  the  outputs  (i.e.,  time  records)  of  the  sensors 
consist  of  amplitude  scaled  and  delayed  replicas  of  the 
waveform  from  the  distant  source,  corrupted  by  additive 
noises,  usually  local  in  origin.  When  the  amplitude  gradient 
cf  the  noise  from  the  distant  source  is  negligible  over  the 
array,  essentially  all  of  the  geometric  information,  e.g., 
rauige  and  bearing,  is  encoded  in  the  set  of  time  delays 
associated  with  the  propagation  across  the  array  of  the  signal 
wavefronts  from  the  distant  source.  This  thesis  treats  the 
topics  of  "ntering  and  signal  processing  co  optimize  the 
estimation  of  the  time  delays. 

The  distant  noise  source  and  the  additive  corrupting 
sensor  noises  are  al.\  assumed  to  be  independent  stationary 
Gaussian  random  processes.  Tlje  time  records  to  be  pro¬ 
cessed  are  long  compared  to  the  signal  and  noise  correlation 
t^imes  and  also -to  the  time  needed  for  a  signal  wavefront 
to  propagate  across  the  array.  These  assumptions  make  ^ 
analysis  based  on  a  Courier' representation  of  the  t line 
records  particularly 'ednvanient. 


1 


The  Cx’am6r-Rao  matrix  bound  (CRMB)  for  the  vector  delay 
estimate  is  developed  in  Chapter  II,  and  it  is  used  as  a 
benchmark,  relative  to  which  the  efficacies  of  two  estimation 
schemes  are  measured.  These  schemes  are  the  maximum  likeli¬ 
hood  precesKor  in  Chapter  III,  and  a  possibly  suboptimal 
multiple  correlator  processor  in  Chapter  IV.  Theory  dictates 
that  the  maximum  likelihood  processor  should  achieve  the 
matrix  bound,  and  this  is  shown.  If  the  number  of  sensors  is 
two  (2)  or  three  (3),  then  it  is  shown  that  the  multiple 
correlator  processor  with  optimally  filtered  inputs  also 
achieves  the  matrix  bound.  The  optimal  filters  are  discussed 
in  Chapter  IV.  Numerical  results  showing  the  effects  of 
suboptimal  filtering  are  presented  in  Chapter  V  for  the 
specific  case  where  the  signal  and  noise  spectra  are  band- 
limited  with  spectral  slopes  of  0,  -3,  or  -6  dB/octave. 

Although  there  have  been  many  applications  of  the 
Cram^r-Rao  estimation  theory  to  communication  theory,  they 
have  generally  been  to  problem.s  of  the  known  signal  in 
Gaussian  noise  type.  Among  the  investigators  who  have  applied 
the  Cram€r-Rao  estimation  theory  to  problems  in  which  the 
signals  and  noises  all  are  random  are  Levin  [1],  Harger  [2], 
and  MacDonald  and  Schulthelss  [33.  Levin  investigated  random 
process  power  spectrum  parameter  estimation.  Karger  treated 
the  problem  of  optimally  processing  data  of  unknown  focus 
in  electro-optical  systems,  MacDonald  and  Schulthelss 
considered  the  problem  of  optimally  estimating  the  bearing 
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f r  ,ra  a  sonar  array  to  a  distant  noise  source.  In  [3]  the 
authors  assume  that  the  signal  wavefronts  at  the  array  are 
planar,  and  the  problem  is  essentially  that  of  estimating 
a  scalar.  This  thesis  generalizes  the  problem  treated  by 
MacDonald  and  Schultheiss  to  the  extent  that  the  signal 
wavefronts  across  the  array  are  assumed  to  have  a  curvature, 
and  it  is  desired  to  determine  this  curvature,  and  hence 
the  rest  of  the  geometry,  by  measuring  the  vector  of  time 
delays  associated  with  the  propagation  of  the  curved  wave- 
fronts  across  the  array  of  sensors.  This  passive  multi¬ 
dimensional  localization  problem  appears  not  yet  to  have 
been  treated  in  the  literature. 

The  following  notation  is  used  in  this  thesis.  If  A 

is  a  matrix,  then  A“^  is  its  inverse.  A*  is  its  complex 
T 

conjugate,  A  is  its  transpose,  tr  A  is  its  trace,  and  det  A 
is  its  determinant.  If  matrices  A  and  B  are  positive 
definite,  then  A  >  B(A  >  B)  denotes  that  the  difference  A-B 
is  positive  definite  (nonnegative  definite).  A  square  matrix 
whose  elements  off  the  main  diagonal  are  all  zero  may  be 
written  as  diag(a^,  3^2,  a^),  where  is  the  i-th 

diagonal  element.  Vectors  are  column  vectors  unless  other¬ 
wise  specified.  1  denotes  a  vector  with  every  element  a 
one  (1).  0  is  a  matrix  of  zeros,  and  I  is  the  identity 

matrix.  <  •  >  denotes  the  expectation  operator,  and 'grad  f 
is  the  gradient  of  the  scalar  f.  The  gradient  of  a  vector 
is  the  matrix  in  which  the  i-th  row  is  the  gradient  of  the 
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1-th  component  of  the  vector.  The  Kronecker  delta  is  denoted 
In  the  usual  fashion  as  6^^ .  If  z  is  a  complex  number  its 
real  axA  imaginary  parts  are  denoted  by  Re  z  and  Im  z, 
respectively. 

The  following  conventions  regarding  integration  limits 
are  observed  in  this  thesis.  Integrals  of  the  form 
/_“  f  dx  are  written  as  /f  dx.  Integrals  of  the  form 

f  dt  are  written  as  /„  f  dt.  The  quantity  tOw  is  defined 
-T/2  ^  ^ 

in  Chapter  II.  Integrals  of  the  form  /  ^  f  dw  are  written 

as  /g  f  dw.  N 

The  symbols  MLE,  FIM,  CRMB,  and  HOT  are  abbreviations  for 
Maximum  Likelihood  Estimate,  Fisher  Information  Matrix, 
Cram^r-Rao  Matrix  Bound,  and  Higer  Order  Terms  (as  in  series 
expansions),  respectively.  Appendix  A  is  a  brief  introduction 
to  the  significance  of  the  FIM  and  the  CRMB. 

Appendix  B  is  an  introduction  to  complex  Gaussl'-.n 
random  vectors,  in  the  particular  sense  of  Goodmann  [^]. 

These  complex  Gaussian  random  vectors  are  pertinent  to 
the  study  of  the  properties  of  real  stationary  Gaussian 
vector  processes  with  long  observation  times. 
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CHAPTER  II 

THE  LIKELIHOOD  FUNCTION  AMD  THE 
FISHER  INFORMATION  MATRIX 
A.  BASIC  ASSUMPTIONS 

In  this  chapter  the  likelihood  function,  I.(D),  and  the 
Fisher  Information  Matrix  fPIM)  are  derived.  The  vector 
argument,  D,  of  the  likelihood  function  is  the  vector  of 
delays  to  be  measured.  The  following  Assumptions  are  made: 

1.  The  random  signal  and  each  of  the  additive  sensor 
noises  are  all  stationary  zero-mean  Gaussian  random  processes. 

2.  The  stationary  zero-mean  Gaussian  random  processes 
of  Assumption  1  are  all  independent. 

3.  The  observation  interval,  T,  is  large  compared 
to  the  correlation  times  of  all  the  stationary  zero-mean 
Gaussian  random  processes.  It  is  also  large  compax’ed  to 
the  time  needed  for  the  signal  wavefronts  to  transverse 
the  array. 
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B.  THE  LIKELIHOOD  FUNCTION,  L(D) 

Each  of  the  M  sensors  of  the  M-element  array  is  observed 

for  a  duration  of  T  seconds  over  the  time  interval. 

T  T 

-  ^  <  t  £  +  ^  .  The  output  of  the  i-th  sensor  is  Xj^(t). 

The  i-th  time-record  is  represented  by  Fourier  coefficients, 


Xi(tUj^)  =  x^(t)  exp{-jku)^t)dt. 


(2.1) 


where  -  2ir/T,  and  to.  =  kto^.  If  only  frequencies  up  to 
NUq  are  to  be  processed,  the  set  of  M  time  records  can  be 
represented  by  MN  complex  Fourier  coefficients.  These  MN 
coefficients  can  be  treated  as  a  single  entity  by  defining 
a  data  vector  X: 

X  ®  (Xj^ (u^ )  ,X2  ^  *  •  •  •  * ^  ^^2  ^  *  *  *  *  ^ *^N ^  ^  (2.2) 

By  Assumption  1,  each  of  the  components  of  the  data  vector 
is  a  zero-mean  complex  Gaussian  random  variable.  Let  the 
covariance  matrix,  R,  be  defined  by 


*  T 

R  =  <  X  X^  >. 


(2.3) 


The  elements  of  R  are  given  by 


^p^“m^\^“n^  ^T  ^T  exp{j  (Wj^^t-w^u)}  <  Xp(t  )Xq(u) .  > . 


(2.4) 


It  is  assumed  that  the  time  record  of  the  output  of 


the  p-th  sensor  is 


Xp(t>  =  s(t-Dp)  +  np(t). 


(2.5) 
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v/here  s(t)  is  the  random  signal  waveform,  D  is  the  signal 
delay  at  the  p-th  sensor  relative  to  the  delay  at  an  arbi¬ 
trary  reference  point,  and  n  (t)  is  the  independent 
additive  sensor  noise.  Let  R_(t)  and  R_(t)  denote  the 

5  p 

signal  and  noise  auto  correlation  functions,  respectively. 

By  Assumption  2  the  expectation  on  the  right  side  of 
Equation  2.^1  is 

<  X  (t)x  (u)  >  =  R  (t-u4D  -D  )  +  (2.6) 

p  q  s  q  P  pq  P 

The  autocorrelations  are  related  to  the  spectral  density 

functions  S(u)  and  Np(w)  by 

RoCi)  =  ir  /  S(“)  exp{jwT}dto  (2.7) 

O  II 

and 

Rp(T)  -  ^  f  Np(u))  exp{jwT}dw  .  (2.8) 

Inserting  Equations  2.6,  2.7,  and  2.8  into  Equation  2.4 

and  integrating  with  respect  to  t  and  u, 

2 

<  X*(Wjjj)Xq(u^)  >  =  /  do)  (S(u)+Np((»)))  exp{Jw(Dq-Dp) } 

rSln(ci)-l-27rm/T)T/2  sin(w+^2Tm/T)T^-, 
(tij+2Tfm/T)T/2  •  “(a)+2Trn/T)‘T/^ 

(2.9) 

By  assumption  3,  the  signal  and  noise  correlation  times 
as  well  as  D^-Dp  are  all  small  compared  to  T.  The  sine 
functions  [5]  in  Equation  2.9  are  orthogonal,  and  since  all 
the  functions  of  u  inside  the  integral  in  Equation  2.9 
vary  slowly  compared  to  the  sine  functions,  for  practical 


purposes 
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(2.10) 

The  real  and  imaginary  parts  of  X,  say  Xjjg  and  Xjj^j, 
respectively,  are  each  zero-mean  Gaussian  random  vectors. 

If  ‘  "  ^lA.  ^  If  ^  WlM  ^  ^ 

then  X  is  a  complex  Gaussian  random  vector,  in  the  sense  of 
Appendix  B.  This  is  true  (asymptotically  in  T).  Consider 

<  Re  Xp(<.^)  Re  X^Cu^)  >  -  <  Im  Xp(a,^)  Im  X^Cm^)  > 

“  /m  dt  du  <  Xp(t)Xq(u)  >  COS(Wjjjt+U)^t) 

»  /  dw  (S(w)+Np(w)6pq)  exp{ju(Dp-D^)} 

•  fij,  dt  ftp  du  exp{jw(t-u)}  cos(Wjj.t+w^u) 

*  ir  /  dw  (S((d)  +N_(u))6  )  exp{jw(D  -D  )} 

'  p  pq  p  q 


^2  rsin(w+a)jjj)T/2  sln(w-u^)T/2 


sin(a)-0}jjj)T/2  sin(a)+a)^)T/2  ” 


(2.11) 


Again  the  sine  functions  on  the  right  side  of  the  last 

part  of  Equation  2.11  are  orthogonal.  Since  the  integv^rs  m 

and  n  are  positive  only,  the  right  side  of  Equation  2.11 

is  essentially  zero.  Thus  the  real  and  imaginary  parts  of 

the  data  vector  X  have  the  same  covariance  matrix.  Almost 

T  T 

Identical  arguments  show  that  <  >  s  «  <  Xj^Xjj^  >  . 

It  follows  that  the  vector  X  is  a  complex  Gaussian  random 
vector  (see  Appendix  B)  and  the  density  function  for  the 
vector  is 
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p(X)  =  det  R)~^  expCX^R’^X*)  . 


(2.12) 


Through  the  remainder  of  this  thesis  the  asymptotic 
nature  of  the  results  derived,  particularly  Equation  2.12, 
will  be  assumed,  and  only  occasionally  emphasized  when  it 
seems  appropriate.  Equations  such  as  2,12  will  appear 
without  apology. 

Equation  2.10  shov/s  that  the  Fourier  coefficients 
for  different  frequencies  are  uncorrelated.  It  is  there¬ 
fore  convenient  to  define  the  following  vectors  and  matrices: 

X(k)  =  (2.13) 

V(k)  »  e‘J“kD2,  (2.14) 

N(k)  »  diag(N]^(a),^),  N2(Wj^),  ...»  Npj(u)j^))  (2.15) 


R(k)  =  N(k)  +  S(a)j^)  V»(k)  V^(k) 


(2.16) 


Using  Equations  2.13  through  2.16  the  density  function 
can  be  written  as 

p(X)  =  n  (/‘  det  R(k))”^  exp{-X^(k)R''^(k)X*(k)} 
k=l 

(2.17) 

N 

=  n  det  R(k)r^  exp{-  I  X^(k)R“^(k)X»(k)}. 


An  application  of  the  "matrix  inversion  lemma"  [6,  p.  133 


shows  that 


R"^(k)  =  N"^(k)  -  G(k)N”-^(k)V*(k)V^(k)N'‘-"(k),  (2.18) 

where  G(k)  is  defined  by 


G(k)  E  .  ,  'rx  +  ^  S(ajj^)Nj^(wj^)]“^  . 


(2.19) 
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The  vector  V(k)  is  conventionally  called  the  steering 
vector. 

In  the  remainder  of  the  text  it  will  be  convenient 
to  use  the  following  simplified  notation:  The  function  G(k) 
may  be  written  as  G(k),  cr  OCku^),  or  G(£0j^),  or  G(w)  if 
0)  =  kUg  is  understood,  or  simply  as  G  with  the  argument 
suppressed  if  the  argument  is  known.  This  notation  will 
be  used  for  all  of  the  frequency  dependent  scalars,  vectors, 
and  matrices.  In  addition  the  set 

B+  =  (ijl  an  Integer,  1  5  i  £  N) 

is  defined  in  order  ^hat  summations  over  the  range  of  the 

Fourier  frequencies,  ,  can  be  simply  written  as 

i»l 

Ib+,  whether  the  frequency  arguments  are  or  are  not  sup¬ 
pressed.  This  allows  summations  of  the  form  to 
be  written  as  and  always  understood  to  be  an  array 
sum. 


The  signal  delay  vector  D  is  defined  by 
D=  (Dj^,  D2,  ...,  . 

The  vector  D  is  the  vector  airgument  of  the  likelihood 
function,  L(D).  From  Equation  2.17, 

Ii(D)  =  [tt^  n  det  R]"^  exp{-  I  x'^R"V} 

B+  B+ 


(2.20) 


(2.21) 


It  is  convenient  at  this  point  to  examine  det  R(k),  and, 
in  particular,  to  show  that  it  is  not  a  function  of  D. 

Let  u  «  exp{Jw^}.  Then 
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+  Su*°2-B3.)  __  Su**^l'l 

(Ol-Dj)  s  +  ...  Su<“M 

•  •  • 

•  •  0 

0  0  0 

IsuC^I-Bm)  Su'°2-Bm)  ...  s  +  N 

Prom  Equation  2.22,  every  element  of  the  1-th  column  of 

Di 

R(k)  has  the  factor  u  ,  and  every  element  of  the  i-th 

-Di 

row  has  the  factor  u  .  Thus  if  R(k|i)  is  the  R(k)  matrix 
with  set  to  zero, 

det  R(k)  =  (u’'°i)^(u^i)^^  det  R(kii).  (2.23) 

In  view  of  Equation  2.23  and  the  definition  of  R(kli), 
det  R(k)  is  not  a  function  of  D^.  Further, 

det  R(lc)  =  det(N(k)  +  S(k)ll^  ). 


(2.2i|) 
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C.  THE  FISHER  INFORMATION  MATRIX 
The  CRMB  for  unbiased  estimators  of  the  vector 
argument  Y  is  the  inverse  of  the  FIM,  denoted  by  (FIM), 
where 

(FIM)  =  -  <  gradCgrad  In  L(Y))'^  >  .  (2.25) 

In  Equation  2.25  L(Y)  is  the  likelihood  function  for  Y, 
and  the  gradients  are  taken  with  respect  to  the  components 

m 

of  y.  The  matrix  grad (grad  In  L(Y))  is  a  matrix  of 
second  partial  derivatives.  From  Equations  2.l8,  2.19, 

2.21,  and  2.23, 

grad  (grad  L(Y))^  =  -  ][  grad  (grad  X^R“^X*)^ 

B+ 

=  X  G  grad  (grad  x'^N"V*v'^N"^X*)^. 

B-5- 

(2.26) 

Let  a  and  b  denote  arbitrary  elements  of  Y, 

The  corresponding  element  of  the  FIM  is  determined  by 

-  "  si  ^  -  I  °  "  A-i^(^vV)n-1x*  >  . 

(2.27) 

The  k,  m-th  element  of  the  matrix  V  V  is  exp{jw(Djjj-Dj^)}. 


Let 


vV  =(exp{Ja)(Djj^-Dj^)>), 


(2.28) 
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so  that 

^  dXP(XV‘’k”> 

«  (Jw)^  A  +  (jw)B  ,  (2.29) 

where  the  matrix  A  contains  all  and  only  those  terms  which 

Involve  differentiating  the  exponential  twice. 

Define  A  and  B  by 

A  =  N'^AN"^  (2.30) 

and 

B  =  N“^BN“^  (2.31) 

In  view  of  the  matrix  identity  X^QX*  =  tr  X*X^Q,  the 
expectation  in  Equation  2.27  is 
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<  VV)N-V  > 


=  <  tr  X*X^((Ja))^  A  +  (Ja))B)  > 
=  tr{RC(}«a)^  A  +  (Ja))B)]}  ( 


(2,32) 


=  tr{(N+SvV)((jw)^  A  +  (Ja))B)}. 

The  matrices  A  and  B  have  only  zeros  for  main 
diagonal  elements.  Thus  A  and  B  have  only  zeros  for 
main  diagonal  elements,  since  N“^  is  diagonal.  Again, 
the  main  diagonal  elements  of  NA  and  NB  are  all  zeros. 
Thus  tr(NA)  =  tr(K3)  =0,  and 


<  vV)S-lx*  > 


trfSvV((Ju)^  A  +  (2.33) 


The  k-th  main  diagonal  element  of  the  matrix 


V  V^B 


=  exp{Ja.(D,^-D„)} 


(2.3^) 


^  exp{ju(D^^j„)} 


'  j  kJ)  ■ 

km 

m 

It  follows  that 

tr(vVB)  .  n 

km 

=  0  , 


(2.35) 


(2.36) 
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-  V>  • 


C2.37) 


Thus  the  only  part  of  the  expectation  in  the  right  side  of 

Equation  2.27  that  is  not  zero  is  the  term  invo.l^'ing 
- 

tr  V*V^A. 

•  T~ 

The  i-th  main  diagonal  term  of  the  matrix  V  V  A  is 


(vVa)^^  =  I 


m 


3(D.-D^)  3(D.-D^)  exp{jw(D,-Dj} 

exp(J.(D 

1  m 


=  I 


3(Di-D^)  3(D^.D^)  1 


3a 


m 


3b  N.N 
i  m 


(2.38) 


Using  Equations  2.33«  2.36,  and  2.36  in  Equation  2.27, 
the  typical  element  of  the  FIM  becomes 


5l<A  =• 


«  «  3(D. -D  )  3(D.~D  ) 

-  V  vv  S  k  m _ km 

-  L  ^  ^  lL  n  h  3a  3b  . 

cx  1,_  K  m 


B+  km 


(2.39) 


Since  T  is  large  and  the  spectra  do  not  vary  appreciably 
over  intervals  of  width  w^.  Equation  2.39  can  be  written  as 


^  /  dw  - - - 5— 

^  B  (1  +  I  f) 

i  “i 

1 


ll 

km  ^ 


(2.il0) 


3a 


db 
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For  the  likelihood  function  L(Y),  the  elements  of  the 
PIM  as  developed  in  Equation  2.39  and  2.40  do  not  depend 
on  regarding  a  and  b  as  elements  of  the  delay  vector 
D.  The  Equations  2.39  and  2.40  are  correct  tso  long  as 
a  and  b  are  parameters,  for  example,  range  or  bearing, 
which  affect  the  likelihood  function  only  in  that  a  and  b 
are  arguments  of  the  delays. 

Specializing  Co  the  case  where  the  gradient  is  taken 
with  respect  to  the  components  of  D,  the  MxM  matrix  of 
second  partial  derivatives  is 

-  <  grad(grad  in  L(D))^  >  = 

I  - ®r-S-C<  I  (2.111) 

i  ^ 

This  is  not  the  PIM,  for  it  does  not  have  rank  M. 

This  is  due  to  the  fact  that  the  delays  appear  in  the 
expression  for  L(D)  only  in  difference  pairs.  One  of 
the  delays  is  arbitrary.  Arbitrarily  set  =  0  . 
Redefine  D  by 

D  =  (^2*  ^2*  ’  (2.^2) 

The  PIM  pertinent  to  the  estimation  of  the  M-1  vector  D 
defined  in  Equation  2.42  is  obtained  from  Equation  2.4l: 

(PIM)  =-<  grad(grad  In  LCD))"^  > 

=  I  20)2  - ^-g-C(  I  jf)  1  (2.43) 

B+  ^  I  H  i  ^ 
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In  Equation  2.^13  Np^  =  diag  (Np^,  ...»  i.  '’■ 

now  an  M-1  vector.  Or  equivalently,  the  FIM  is  the  M~3 
by  M-1  lower  right  partition  of  the  matrix  on  the  right 
side  of  Equation  2.40, 

An  unbiased  estimate  Y  of  the  vector  Y  is  efficient 
if  the  covariance  matrix  of  Y-Y  is  the  CRMB.  A  necessary 
and  sufficient  condition  for  the  existence  of  an  efficient 
estimate  is  that 

..  grad  In  L(Y)  =  M(Y)(Y-Y),  i2.hH) 

where  M(y)  is  a  matrix  which  depends  only  on  Y  and  not 
on  the  observations.  (See  Appendix  A.)  For  the  problem 
treated  in  this  paper 

grad  In  (L(D)  =  I  grad  In  X^R"^X*.  i2,U5) 

B+ 

Every  term  on  the  right  side  of  Equation  2.45  has  the 
observations,  that  is,  the  Fourier  coefficients,  as  factors. 
Thus  in  general,  efficient  estimates  do  not  exist. 

However,  it  is  shown  in  the  next  two  chapters  that  under 
certain  conditions  correlators  and  other  square-law 
processing  schemes  are  asymptotically  errioi'^nt,  in  the 
lim.it  of  large  T. 
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0.  THE  M=2  AMD  M=3  CASES 

If  the  number  of  sensors  is  M  =  2,  the  matrix  bound 
reduces  to  a  scalar  bound.  The  scalar  delay  to  be 
estimated  is  D,  the  delay  from  the  first  to  the  second 
sensor.  The  variance  of  any  unbiased  estimate,  D, 
of  D  satisfies 


var(D)  >  ^ 


4“^“  “  r;^TxJ 


I  -1 


(2.i}6) 


If  the  number  of  sensors  is  M  =  3.  and  the  vector  to 
be  estimated  is  defined  as  Y"  =  (YTjYg)*^  =  (02,0^-02)^, 


(PIM)  =  ~ 


Ki2  4.  K^3 


K 


13 


K13 

K23  +  K^3 


(2.il7) 


where 


K. , 
ij 


/ 

B 


s2/NiNj 


3 

1  +  I 


dw 


k=l 


(2.48) 


It  is  shown  in  Chapter  IV  that  for  M  -  3  a 
processing  scheme  using  three  properly  filtered  correlators 


achieves  the  matrix  bound. 
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CHAPTER  III 

THE  MAXIMUM  LIKELIHOOD  ESTIMATE 
A.  THE  ASYMPTOTIC  NATURE  OF  THE  MLE 
When  the  MLE  la  based  on  a  large  number  of  Independent 
samples  It  is  consistent,  asymptotically  normal,  and 
asmptotically  efficient  [73.  Since  this  thesis  treats  the 
case  in  which  the  observation  time  T  is  large  and  the 
correlation  times  are  small,  there  should  be,  in  some  sense, 
a  large  number  of  independent  samples.  It  should,  therefore, 
be  possible  to  examine  the  errors  in  the  MLE  and  shovj  that 
the  covariance  matrix  of  the  errors  is  in  fact  the  CRMB. 

In  this  chapter  this  is  demonstrated.  Since  the  results 
developed  in  this  chapter  are  independent  of  the  true 

I 

delays,  the  true  delays  are  assumed  to  be  all  equal  to  zero. 
Without  loss  of  generality,  the  signal  delay  at  the  first 
sensor  is  taken  to  be  zero,  and  the  delay  set  to  be 
measured  is  the  set  of  signal  delays  from  the  first  to  the 
remaining  M-1  sensors. 

The  MLE  for  the  delay  vector,  D,  satisfies 
grad  In  L(D)  =  0,  where  the  gradient  is  taken  with  respect 
to  the  M-1  unknown  delays.  The  vector  D  so  determined  is 
the  vector  of  measurement  errors.  The  vector  of  delay 
measurement  errors  is 

d'^  =  (Dg,  D3,  ...,  ,  (3.1) 
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and  the  error  In  the  steering  vector  Is 

=  (1,  exp{J(oD2},  ...»  exp{Ja)Dj^}),  (3.2) 

The  error  vectors  D  and  V  satisfy 

0  =  grad  In  L(D) 

=  grad  I 
B+ 

»  grad  I  GX'^N''^V*V^N‘*^X* 

B+ 

X 

=  grad  I  G  II  exp{Jw(D^-D^)}  (3.3) 

B->  i  n  ^ 

X  X* 

=  grad  I  G  II  j5ijj^(l+Jaj(Dj^-D^)+|(J(o)2(Dj^-D^)2+HOT) 

B+  in  ^  ” 

«  grad(A  +  BD  +  I  D^CD  +  HOT). 

In  Equation  3.3,  A  +  b'^D  +  |  D^CD  +  HOT  is  the  series 
expansion,  in  terras  of  the  vector  D,  for 

I  Q  n  (XiXj/N^N^)exp{jw(Djj->D^)}. 

B+  in 

Since  the  error  vector,  D,  is  assumed  to  be  small.  Equation 
3.3  leads  to  0  =  B  +  CD  +  HOT,  or  to  first  order, 

D  *  -0  B.  The  remainder  of  this  chapter  justifies  the 
following  (asymptotic)  Equations; 
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1)  <  > 

2)  <  b'^B* 

3)  C  =  <  C 

4)  <  D  >  = 

5)  <  DD^  > 


0 

(3.4) 

a  (FIM) 

(3.5) 

=  -(PIM) 

(3.6) 

1 

(3.7) 

'  (CRMB) 

(3.8) 
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B.  THE  VECTOR  B 


Prom  Equation  3^3, 
X,X* 


BD  -  I  0  n 

B+  in  ^  " 


»  I 

3+ 


•-j^i  n  n^i— ^ 

n  n 

=  C  I  JwGlV^CXXp'^  -  X*X^]Np^3D, 

B+ 

where  Xp  is  the  X  vector  with  the  first  component 
partitioned  away.  Thus 


(3.9) 


B  »  X  JuGl^N"^[XXp^  -  X*Xp]Np^  , 
B+ 


(3.10) 


Since,  at  each  frequency. 


R  =  <  X*X^  > 


=  N  +  S  1 

*T 

=  <  XX  -^  >  , 


(3.11) 


it  follows  that  <  B  >  =  0. 


T„« 


In  examining  the  terms  of  <  B  B  >  let 

x» 

“k  -  <  J  sj’  h;^  ■ 


Then 


^  “k  ^  ^  iTX 


<  uij  >  . 


(3.12) 


(3.13) 
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If  oj  and  w  are  distinct  frequencies, 
p  q 


<  (u^^(a)  )-uf(a)  ))(u  (w  )-u*(a)  ))*  > 
itp  Kp  n4  n4 


=  <  (u,  (u)  )-uf  (u  ))  >  <  (u  (t»)  )-u  (w  )*  > 

'  k'  p''  k'“  p-"  '  n  q  n  q 


=  0  , 


(3.1^1) 


since  the  Fourier  coefficients  at  different  frequencies  are 
independent.  The  element  in  the  (p-l)-th  row  and 
(q-l)-th  column  of  <  3*^6*  >  is,  from  Equations  3.9,  3.12, 
and  3.1^, 


<  8*^3*  >  ,  T  *  I  <  (u  -u*)(u  -u’)"  > 
p-l,q-l  4,  "  «  '“p  “q' 

3+ 

=»  I  (o^G^  <  u^u%u*u  -u*u*-u_u„  >  . 

pqpqpqpq 

«  x.x»x»x„  . 

p  <1  ll  Wp\ 

=  yy 

N.N.N  N„ 
ik  i  k  p  q 

«  » 

since  <  X^Xq  >  =  <  Xj^Xp  >  =  0. 


.2„2 


(3.15) 


<  u*u^  >  =  <  u  u*  >  . 

p  q  . 


Since  R  =  N4-S2J.  is  symmetric, 

X  u* 

P  q 

Further,  y  y  y»y* 

vv  <  *iVp  q  > 
“p“q  *  II 

,  rr  '^pl’^qk  *  Vql 

ik 

«  <  u*u*  >  . 

P  q 


,  (3.16) 


(3.17) 
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T  * 

Examining  the  elements  of  <  B  B  >  as  given  by 
Equation  3.22  shows  that  <  B^B*  >  is  the  M-1  by  M-1 
lower  right  partition  of  the  matrix 

p  »  J  I  ll'^N“^]  .  (3.23) 

(1  +  y  — )  ^1  ”” 

B+  ^  N/  1 

i  ^ 

But  this  lower  right  partition  of  P  is  the  FIM  for  this 
problem^  as  was  shown  in  Chapter  II. 

This  completes  the  proof  of  Equations  3.^  and  3.5. 
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THE  C  MATRIX 


From  Equation  3*3 


|d^od  .  I  i(i.)^a  II  - 

r>  »  -•  t-  1  K 


(3.2i|) 


The  terms  are  the  elements  of  the  sample 

covariance  matrix  based  on  T  seconds  of  data.  These 

sample  covariance  elements  do  not  converge,  even  If  T 

is  arbitrarily  long  [8].  However,  If  the  spectra  being 

estimated  are  sufficiently  smooth,  the  sample  covariances 

can  be  averaged  with  samples  from  nearby  frequencies 

to  provide  statistical  convergence.  By  hypothesis,  the 

spectra  In  the  problem  being  treated  are  sufficiently 

smooth.  The  summation,  J  ,  provides  a  smoothing  over 

B+ 

adjacent  frequencies  of  the  weighted  spectral  convariance 
estimates.  Thus  It  Is  assumed  that  C  »  <  C  >,  and  any 
statistical  variations  in  CD  are  of  second  order  relative 
to  the  variations  in  B.  From  Equation  3.3, 


^’’OD  =  I  |(Ja)^0  II  *  t>l-  2  Dj^D^).  (3.25) 

B+  Ik  ^  ^ 

The  rauige  of  the  Indices  1  and  k  In  the  summations  is 
from  1  to  M.  =  0.  Using  the  argument  preceding 


Equation  3.25, 


IT  IT 
^^CD  «  <  C  >  D 


(3.26) 


=  I  (j<o)^Q  n  rir^i  - 

B+  ik  ^ 
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N.N.  i  N.  ^  N. 

ik  ^  ^  i  ^  k  ^ 


■;^[‘-=4] 

.  [. .  i  W  .V 


(3.27) 


where  Np^  is  the  N“^  matrix  with  the  first  row  and  column 
partitioned  away.*  In  similar  manner 


ik  ^  ^ 


(3.28) 


where  Rp  is  the  appropriately  partitioned  R  matrix.  Thus 


<  C  >  D  =  X  (ju)^G{(l  -I*  I  j^)D^Np^D-D'^Np^RpNp^D> 

i  ^ 


=  D^{  I  (jw)'^GC(  I  j^)Np^-SNp^  ll^Np^3}D. 

j  1 


(3.29) 


It  follows  that 


<  0  >  =  I  2(Jb)2 - 2^ -  [(i  jp)  iiV^l 


In  fact 3 

-  <  C  >  *  <  8*^8*  >  =  (PIM)  , 
which  demonstrates  Equation  3.6. 


(3.31) 
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D.  THE  D  VECTOR 

Since  the  measurement  errors  are  small,  the  D  vector 

T  T 

is  essentially  determined  by  0  *  B  +  C  D.  Since 
C  «  <  C  >  with  negligible  error,  and  since  <  C  >  is 
symmetric,  it  follows  that,  asymptotically, 

D  »  -  <  c  >“V  ,  (3.32) 

<D>=-<C>“^<B^>»0,  (3.33) 

and  the  covariance  matrix  for  D  is 

*  T  -IT**  -1 

<  D^D-^  >  «  <  C  >  •*•  <  B-^B  >  <  C  > 

=  <  C  (PIM)  <  C  >“^ 

»  (FIM)**^ 

«  (CRMB)  .  (3.34) 

Thus  the  MLE  for  the  delay  vector  is  efficient. 

It  is  also  unbiased.  This  demonstrates  Equations  3.7 
and  3.8. 
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CHAPTER  IV 

CORRELATOR  DELAY  MEASUREMENT  SYSTEI/IS 
In  this  chapter  a  delay  measurement  system  using 
correlator  techniques  is  studied.  The  covariance  matrix 
for  the  delay  er'  ates  is  derived.  The  use  of  filters 
to  optimize  the  system  is  studied.  For  the  M  =  2  and 
M  ®  3  cases,  the  correlator  based  niisasurement  system  with 
certain  filters  is  shown  to  achieve  the  CRMB.  A  conjecture 
is  made  regarding  the  optimum  filters  when  M  >  3* 
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A.  CORRELATOR  MEASUREI4ENT  COVARIANCES 
The  output  of  an  Ideal  correlator  with  T  seconds 
of  integration,  and  with  the  i-th  and  j-th  sensors  for 
inputs,  is 

ZijCT^:  =  i  /^x^(t)xj(t  -  Tj)  dt  .  (4.1) 

By  the  assiunptions  in  Chapter  II,  the  additive  sensor  noises 
are  independent.  It  follows  then,  that 

‘  ^  -  '>3  -■'j>  > 

and  that  the  peak  of  the  function  (Tj )  determines  an 

unbiased  estimate  for  ^ j  * 

Let  Xj(t),  Xj(t)  and  Xj^(t),  x^(t)  be  two  pairs  of 

sensor  outputs.  These  pairs  are  correlated  to  determine 

and  z  (T  ).  The  time  records  x.  (t)  and  x.(t)  are  assumed  to 
kn  n  1  j 

be  the  outputs  from  two  distinct  sensors.  The  same  applies 
to  X.  (t)  and  x  (t).  However,  it  is  not  assumed  that  all 

iC  il 

four  Inputs  are  distinct.  With  the  Integers  i,  j,  k,  and  n 
fixed,  it  is  convenient,  for  this  section,  to  relabel 
x^(t)  as  x^(t),  Xj(t)  as  XgCt),  Xj^(t)  as  x^Ct),  and  x^(t) 
as  X|j(t).  Thus,  the  Xj^(t)  of  this  section  may  or  may  not 
be  the  x^^Ct)  of  Chapter  II.  And  the  x^Ct)  of  this  section 
may  be,  in  fact,  the  same  as  x-(t)  or  Xj^(t). 

The  correlators  have  as  outputs  Z2i|(T2|). 

Define  ^34^'^!!^ 
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^311  ^'^4^  =  Z3i,('ri,)  -  <  Z3h(T^)  >  (^-3) 

Let  T2  and  T^  be  those  values  of  T2  and  T^,  respectively, 
at  which  the  correlogram  peaks  are  located.  Let  62  and  ejj 
be  defined  by: 


®2  *^2  ^1  “  ^2 


(il.il) 


and  let  e2  and  e|j  be,  respectively,  the  values  of  ep  and  62^ 

A  A  A  A 

When  T.,  =  T2  and  Then  e2  and  e^j  are  the 

measurement  errors.  The  correlator  outputs  are  with 
probability  1  everywhere  differentiable  functions  of  T2 

A  A 

and  Tij.  T2  and  T^j  are  determined  by: 


'12 


(T2)  =  0 


A 

at  T2  =  T2 


dT|2  ^34^’^4^  °  at  T22  =  T22 

Letting  the  derivatives  be  denoted  by  primes,  and  using 
the  definitions.  Equations  (M.5)  become: 


0  =  R^(e2,)  +  y3n(T4) 


(^.6) 
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The  exTors  are  assumed  small*  Assuming  that 
is  differentiable  at  e  =  0,  It  has  the  series  expansion 

R  (e)  =  R  (0)  +  I  R"(0)e^  +  HOT  .  (4.7) 

S  S  s 


Using  Equation  4.7  and  neglecting  the  HOT,  Equations  4.6 
yield: 

®4  "  "  y3l}(Tl|)  » 


whence 


<  >  =  (r"(0))"2  <  yi2(T2>5'3l|^^4'  ^ 

Next,  the  expectation  on  the  right  side  of  Equation  4.9 
is  evaluated.  The  evaluation  is  accomplished  by  using 
Fourier  coefficients  and  Fourier  series.  The  coefficients 
are  as  defined  and  used  in  Chapter  II,  and  the  summations 
that  follow  are  understood  to  range  over  the  (positive  and 
negative )  integers . 


x^^Ct)  =  I  I  X^(k)  exp{Jka)^t}  . 
k 

X2(t  -  ^2)  =  I  I  X2(k)  exp{-.jk(i)Q(t-T2)>  . 
k 


(4.10) 


(4.11) 
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yi2(T2)  =  I  x^(t)x2(t-T2)  dt  -  <  x^(t )x2(t~T2)  > 


=  II  /  dt  exp{Ja)  t(i-k)})X,(i)xJ(k)  exp{jkWQT2} 

ik  T 


-  <  x^(t )x2(t-T2)  > 


“2  I  X^(k)X2(k)  exp{jk(iO^T2} 
^  k 


<  x^(t)x2(t-T2)  >  , 


since 


/  dt  exp{ jw^tCi-k) }  =  T  «ik- 


(4.13) 


^  ~2  ^  jkwQX^(k)X2(k)  exp{jka)QT2} 


T  k 


^  <  X3^(t)x2(t-T2)  >  . 


(4.14) 


From  Equation  4,l4,  <  >  =  0,  since  both  terms  on 

the  right  side  of  Equation  4.14  are,  after  the  expectation 

is  taken,  equal  to  +  Rg(D2  “  ^2  ~  "2^* 

An  expression  for  can  be  written  by  replacing 

1  and  2  where  they  occur  in  Equation  ^*.l4  by  3  and  4, 
respectively.  Thus 
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^“12 ^^2  ^^34 ^  “ 

-a  n  C(J«^nk)(J(o  n)  <  X  (k)X*(k)X.(n)x!!(n)  > 
kn  ®  ^  :s  *4 

•  exp{Ja)Q(kT2  +  nT^^)}] 

=  “a  II  (JWok)(Jaj^h)C  <  X2^(k)X2(n)  >  <  X2(k)xJ(n)  > 

^  kn 

+  <  X3^(k)xJ(n)  >  <  X2(n)X2(k)  >  ]  . 

(4.15) 

The  last  part  of  Equation  4.15  follows  from  the  argument 
made  immediately  after  Equation  4.l4. 

The  Fourier  coefficients  at  positive  and  negative 
frequencies  are  related  in  such  a  way  that 

<  Xj^(k)X3(n)  >  =  <  Xj(k)X*(-n)  > 

<  X^(k)X*(k)  >  if  n  =  -k, 

0  if  n  7^  -k.  (4.16) 

■ 

Thus , 

^  ^12  ^*^2  ^^34  ^^4  ^  ^ 

=  “a  I  (n%)^C  <  Xj^X*  >  <  X^X^^  >  exp{jna)^(T2-Tj^)} 

^  n 

-  <  X^X*  >  <  X-X*  >  exp{jnUj^(T2+T;^}]  . 


(4.17) 
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Now  replace  1,  2,  3j  and  ^  everywhere  in  Equation  ^.17 
by  h,  i,  k,  and  1,  respectivelv.  Note  that 


<  X^X^  >  =  T(N^5j^^  +  S  exp{ja)(D^-Dj^)}) 


=  T(S  +  exp{jo)(D^-Dj^)}  . 


iH.lB) 


Hence  Equation  ^.17  becomes 


n 


•[(S  +  ^i^il^  exp{JnwQA} 


•(S  +  Nj.6j^^)(S  +  exp{jna)^B}], 

(^.19) 


where 


A  =  T^  -  +  Dj^  +  D^  -  Dt  , 


(il.20) 


and 


B  =  T.  +  Tt  -  D,  +  Dt  -  D,  +  . 

1  1  h  i  k  i 


(4.21) 


Next  let  T^  =  T^  and  =  T^  .  Then  A  and  B  become 


A  =  ei  -  ei 


(4,22) 


and 


E  =  64  +  e. 


(4.23) 


'i  ■  "1  » 

respectively.  Neglecting  the  HOT  in  the  errors.  Equation 
4.9  can  be  written  as 


<  eiei>  =  (R^0)r2  5^  f  du. 


U) 


(4.24) 


•f(S  +  NjjShk^'^  ■*■  +  Nj^^hl^CS+Vii^))- 


Ml 


1 


'M 


si 


M 


% 

9si 


M 


% 


m 


i 


I 


as 

I 
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The  elements  of  the-  covariance  matrix  of  the  correlator 
delay  estimates  are  determined  by  Equation  4.2i|. 


36 
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B.  FILTERS  FOR  CORRELATOR  SYSTEMS  (M=2  AND  3) 

In  this  section  the  use  of  filters  to  optimize  the 
corelator  delay  measurements  is  studied.  For  the  M  =  2 
and  M  =  3  cases  the  optimal  filters  are  displayed.  For 
M  >  3>  the  optimal  filters  are  conjectured. 

Consider  first  the  M  =  2  case.  There  is  only  one 
delay  to  measure.  Let  it  be  D  =  Dg  -  D^.  The  estimate 

A 

for  D  is  denoted  D.  From  Equation  4.24  and  2.46, 


var(D) 


5)  _  2ir  /b  ^  ^%^2  ^  ^ 

(  /  w^S  do))^ 


if[/3 


sVn^n^ 

,  .  s  ,  "s 


“r . 


(4.25) 


If  filters  Fj^(w)  and  F2(w)  are  to  be  used  on  the  outputs 
of  the  sensors  1  and  2,  respectively,  the  filters  must 
have  Identical  phase  responses.  If  they  do  not,  then  the 
filtered  signals  will  have  different  delays  at  different 
frequencies,  and  this  could  bias  the  delay  estimate. 

Since  the  phase  responses  are  identical,  the  filters  may  be 
assumed  to  be  identical  filters.  Let  F  =  F^  =  F2  . 

The  filtered' signal  spectrum  is  S(a))  |F(u)  |  ^.  Similarly, 
the  filtered  noise  spectra  are  N^Cw)  |F(a))  |  and  N2(w)  iFCu)  |  . 
Thus  Equation  4.25  is  readily  modified  to  account  for  the 
filtering.  Equation  4.25  becomes 
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var(D)  =  ^ 


(  f  (o^lFpS  dw  )' 
B 


S^/N,N, 


^  il  r  r  m2  "  /»1"2  -1.1 

-  [^b“  TTT^  j 


"l  «2 


(4.26) 


If  the  filter  |P|  is  defined  by 


lF(w) 


“l  ^2 


(4.27) 


Equation  4.26  becomes  an  equality.  The  measurement 
variance  is  now  the  Cramer-Rao  bound.  Thus,  although 
efficient  estimates  were  shown  in  Chapter  II  not  to  exist 
for  this  case,  none  the  less,  a  correlator  system  with 
filters  determined  by  Equation  4.27  provides  an  asymptotically 
efficient  delay  estimate. 

For  the  M  >  2  case,  there  are  M(M-l)/2  sensor  pairs 
to  be  correlated.  Let  e^j  be  the  error  in  the  delay 
estimate  based  on  the  correlation, 

^  ^T  »  (4.28) 

P  P 

where  (t)  and  Xj  (t)  are  the  filtered  x^(t)  and  Xj(t)  ' 
processes,  respectively.  From  Equation  4.24 

-  *  2.  /b  '5“ 

"  “  T  ^  d..  /  a,.  ’ 

B  ij  B 

(4.29) 
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where 

G(lj;kl)  =  C(S+N^6^j^)(S+Nj6^.^)  -  (S+N^6^^)  (S+Nj  ]  . 

(il.30) 

In  Equation  4.29,  is  the  filter  used  on  the 

and  Xj  waveforms,  which  are  the  inputs  to  the  correlator 
that  computes  (T^ ) . 

Let  the  M(M-l)/2  errors  be  displayed  as  a  coliunn 
vector  e,  and  let  P  be  the  covariance  matrix  for  the  errors 

Pg  =  <  ee*^  >  .  (4.31) 


The  main  diagonal  terms  of  P^  are  the  individual 
variances  of  the  M(M-l)/2  delay  measurements  from  the 
M(M-l)/2  correlators.  Prom  the  discussion  pertaining  to 
Equation  4.26,  the  filters 


Ij 


s/Vi. 


1  +  -5-+  -s. 


(4.32) 


will  minimize  the  main  diagonal  terms  of  P^.  It  is 
tempting  to  conclude  that  these  filters  will  then  be 
the  optimal  filters.  But  this  is  not  true  in  view  of  what 
follows. 

For  the  problem  being  considered,  a  vector  (of  delays) 
is  to  be  optimally  estimated.  The  observations  (from  the 
correlators)  are  a  linear  combination  of  the  delays 
to  be  estimated,  corrupted  by  additive  zero  mean  noise. 
Therefore,  the  vector  to  be  estimated,  D,  is  linearly 
related  to  the  observations,  T,  by 
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T  =  AD  +  a  . 


(4.33) 


The  covariance  matrix  for  e,  is  given  in  Equation  4.31. 
It  is  known  that  the  optimal  linear  estimate  for  the  vector 
D  is  the  Gauss-Markov  estimate  [9], 


D  =  [a'^p“^a3“Vp"^t  . 

6  Q 


(4.34) 


The  Gauss-Markov  estimate  is  the  minimum  variance 
linear  unbiased  estimate  for  estimating  a  vector,  given 
lineeir  observations  corrupted  by  additive  zero-mean  noise. 
The  covariance  matrix  for  the  Gauss-Markov  estimate  is 


<  (D-D)(D-D)^>  =  [A^P^^A]-^  . 


(4.35) 


Without  loss  of  generality,  it  is  convenient  to  fix 
the  order  of  the  elements,  T. ,,  of  T  by  c--finlng 


T  -  (T^2>  '^i3»  •••»  "^(m-DM^  * 


(4.36) 


In  addition  define  the  following  scalars,  vectors,  and 
matrices : 


e  -  (e^2»  ®i3»  ®(M-1)M^ 


(4.37) 


P  =  diag  (iF^gl^  |Fi3l^  |F(m-1)mI^^ 

K  =  /  w^SP  do) 

B 


=  diag  (K^2»  ^13>  •••»  ^(M-l)M^ 


G  =  CG(iJ;kl)]  . 


(4.39) 

(4.40) 
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The  matrix  G  has  for  its  elements  the  terms  G(ij;kl), 
positioned  according  to  the  scheme  determined  by 
the  order  of  the  subscripts  in  Equation  4.36.  Then  from 
Equations  4.29,  4.31,  and  4.35  through  4.40, 

.-1 


Pg  =  "Y  K“^(  /  PGP  dco)K‘ 
B 


(4.41) 


and 


<  (D-D)(D-D)'’^  >"^  =  X  I  £j2pgp  (iu)“lKA  .  (4.42) 

B 


Consider  the  M  =  3  case.  The  matrix  G  is 


G  = 


N^N2  +  S(N^+N2)  “‘’I 


SN, 


-SN. 


SN, 


-SN- 


+  S(N^+N2) 
SN. 


SN. 


’'2  ‘^”3 

Let  the  filters  be  specified  by 

I  TH  I  2 


N2N2+  S(N2+N^ 

(4.43) 


IJ 


1  + 


N,. 


(4.44) 


k=l 


Then 


PGP  = 


1  +  I  iT 


dlag( 


V2’  ^1^^*  ^2^3' 


SVN1N2N3 

"  (1  M 


1-11 
-1  1  -1 
1-11 


(4.45) 
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Since 


1-11 
-1  1  -1 


1-1  1 


11  -1  1 


(4.1»6) 


define  =  (1  -1  1),  so  that 


Pg  K-^CK  +  bww'^DK”^  , 


(4.47) 


where 


5  S^/NtN^N, 

-  /  - --^  --5  du 

B  Cl  +  I  f 


(4.48) 


By  the  matrix  inverse  lemma  [7] 


P*”^  ss  TK  —  I 


(4.49) 


=  ^  -  vnv'^] 


(4.50) 


with 


1  T 

d  =  i  +  w^Kw  . 


(4.51) 


It  is  now  necessary  to  specify  the  vector  D  and  the 
matrix  A.  Let  D  be  defined  as 


so  that 


D  =  (Dg,  -  Dg)  , 


1  0 


(4.52) 


1  1 


0  1 


(4.53) 
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Then 


A  A 


<  (D-DXD-D)^>'^= 


JL  a'T 
2vd  " 


dK^2-l 


1 

-1 


dKis-l 


-1 

1 


dK23-lj 


A 


T 

2Trd 


•d(K,2+K,3) 


L  dK 


13 


d(K^^  :i23)J 


For  the  specific  choice  of  filters, 

P  S^/W  N. 

K  =J  oj^  - dto  .  (4,55) 

B  1  +  I  if- 
Nu 

k  ^ 

The  FIM  matrix  for  M  =  3  with  the  same  choice  of 
delay  vector  to  be  estimated  was  given  in  Equations  2.4? 
and  2.48.  Those  Equations  are  the  same  as  Equations  4.54 
and  4.55.  That  is,  the  Gauss-Markov  estimate  obtained 
from  the  three  correlator  processing  scheme,  with  the 
inputs  to  the  correlators  filtered  according  to 
Equation  4.44,  achieves  the  CRMB. 

Thus,  for  M  =  2  and  M  =  3»  the  correlator  processing 
scheme  can  provide  the  best  possible  estimate,  if  the 
proper  filters  are  used. 
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CHAPTER  V 

SUBOPTir-IAL  FILTERING 

In  the  preceding  chapters  optimum  filtering  schemes 
for  correlator  delay  measurement  systems  were  discussed. 

An  interesting  question  is  how  sensitive  are  the  measure¬ 
ment  errors  to  changes  in  the  filter  design  from  the  opti¬ 
mum.  These  design  changes  may  be  deliberate  or  inadvertent, 
An  Instance  of  the  former  is  when  the  prior  knowledge  of 
the  signal  spectrum  is  limited.  In  such  a  case  the 
designer  may  choose  to  design  for  an  assumed  worst  case. 

Or  perhaps  he  may  choose  to  simply  whiten  the  input  noise 
Or  perhaps  the  correlator  system  is  also  to  be  used  for 
detection.  The  designer  may  choose  to  use  the  Eckart 
filters,  since  they  are  the  optimal  filters  for  correlator 
detectors  [10]. 

In  this  chapter,  the  degradation  in  the  correlator 
estimator  performance  is  studied  for  W  =  2.  It  is  assumed 
that  both  noises  have  the  same  spectrum.  The  signal  and 
noise  spectra,  S(u)  and  N(w),  respectively,  are  both  taken 
to  be  bandlimlted  with  constant  slopes  of  0,  -3,  or  -6 


dB/octave . 


S(w)  =  <1 


|'2A  ,  1  £  w/u^  £  1  +  W 


,  elsewhere  . 


(5.1) 


NOLTR  72-120 


N(w)  = 


2B  (a)/w^)“  ,  1  ^  w/u)^  5.  1 


,  elsewhere  . 


(5.2) 


In  Equations  5.1  and  5.2  S(u)  and  N(w)  are  the  one 
sided  spectra.  W  is  a  bandwidth  variable.  For  W  =  1,  3, 
7,  and  15,  the  bandwidths  of  the  spectra  are  1,  2,  3,  and 
4  octaves,  respectively.  The  spectrum  slope  is  determined 
by  a  and  b,  each  of  which  will  be  equal  to  0,  1,  or  2. 

The  suboptlmal  filters  will  be  defined  by: 


I''eckI  =■  S''" 

|F„hI®  =  I/N 
|2  . 


1  +  2| 


(5.3) 


(5.^) 

(5.5) 


(5.6) 


^ECK  Eckart  filter.  is  the  filter  that  whitens 

the  input  noise.  Fqpt  optimal  filter  for  delay 

estimation.  Notice  that  for  large  SNRs  (Signal  to  Noise 
Ratios)  the  optimal  filtor  essentially  whitens  the  noise. 
(The  system  performance  is  unaffected  by  filter  gain  con-  . 
stants.)  For  small  SNRs  the  optimal  filter  is  essentially 
an  Eckart  filter. 

In  what  follows  the  system  dependence  on  the  input 
SNR  is  studied  by  plotting  curves  for  five  SNRs  determined 
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(|)  -  2“^  ,  k  =  C,  1,  2,  3,  ii.  (5.7) 

T  ordinates  of  the  curves  will  be  the  measurement  stand¬ 
ard  deviations  for  the  optimally  filtered  case,  or  the 
degradation  in  dB  when  suboptimul  filters  are  used.  The 
variable  W  is  used  for  the  abscissa. 

Whelchel  [11]  used  the  spectra  of  (5.1)  and  (5.2)  in 
an  analysis  of  the  effects  of  suboptimal  filtering  on 
correlator  performance.  Whelchel  v/as  primarily  interested 
in  signal  detection,  and  used  the  output  SNR  of  the  correlator 
as  a  measure  of  detection  capability.  The  study  of  sub- 
optimally  filtered  correlator  estimators  in  this  chapter 
in  part  parallels  Whelchel’ s  study  of  suboptimally  filtered 
correlator  detectors. 

2  2  ? 

Denote  the  measurement  variances  by  ^ECK’ 

2 

and  Dqpijij  to  correspond  to  the  filters  Fj^p,  F£qt^>  and 

Fqpt* 

Let  d  =  a  -  b.  When  the  spectra  of  (5.1)  and  (5.2) 
are  used  with  the  filters  defined  by  (5.3),  (5.^),  (5.5), 
and  (5.6),  and  the  estimation  variance  is  calculated  using 
(^1.26),  the  result  in  all  cases  except  one  depends  on  d, 
and  not  specifically  on  a  and  b.  Only  when  the  filters 
Fyjj  are  used  does  the  corresponding  variance  depend 
specifically  on  both  a  and  b. 

Graphs  showing  hov;  the  standard  deviation  of  the 
optimally  filtered  delay  estimate  depends  on  the  parameters 
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k  and  W  are  given  in  Figures  1,  2,  and  3  for  d  =  0,  1,  and 
2,  respectively.  The  curves  are  normalized  by  the  factor 
/( (0^  T),  so  that  /(w^  T)-  Dqp^(W)  Is  dimensionless.  If 
for  example,  and  T  are  chosen  so  that  /(to^  l!)  =  lO^sec”^, 
then  the  abscissas  of  these  three  figures  read  directly  in 
milliseconds.  For  d  «=  0  and  d  =  1  Dqp^(W)  can  be  made 
arbitrarily  small  by  letting  W  be  sufficiently  large.  This 
is  not  true  for  d  =  2.  The  curves  of  Figure  1  also  apply 
to  Dyy  and  since  for  d  «  0,  S/N  is  simply  a  constant. 

They  also  apply  to  Dj^p  when  the  noise  and  signal  spectra 
are  both  flat  (a  =  b  «  C). 

The  asymptotic  nature  of  the  curves  for  small  values 
of  W  may  not  truly  represent  the*  behavior  of  the  system 
measurement  error.  This  is  because  the  derivation  leading 
to  (^.25)  assumes  a  sufficiently  lai'ge  time-bandwidth 
product  to  yield  measurements  with  small  errors. 

Figures  ^  through  7  show  the  processor  performance 
(in  dB,  relative  to  the  optimal)  when  suboptimum  filters 
Dyjj  and  are  used.  In  these  curves  coth  the  optimum 

and  the  suboptimum  systems  are  presumed  to  process  the  same 
band  of  frequencies,  determined  by  the  argument  W,  For 
example,  in  Figure  ^  the  system  degradation  is  given  by 

dB  loss  »  10  log^^p  (W)/D^p^(W)),  (5.6) 

for  d»a-b“*l.  As  indicated  by  Figures  5  and  6,  for  the 


NOLTR  72-120 


assumed  spectra,  the  Eckart  filtered  system  sustains 
a  processing  loss  relative  to  the  optimum  of  at  most  about 
0.5  dB.  The  processing  loss  decreases  as  the  input  SNR 
decreases,  as  expected. 

Figures  8  and  9  depict  the  system  performance  if  only 
flat  band-pass  filters,  Fjjp»  az*e  used.  For  these  figures 
d  =  2,  with  a  =  2  and  b  =  0  in  Figure  8,  and  a  =  3  and 
b  =  1  in  Figure  9.  In  these  figures  the  system  degradation 
is  measured  relative  to  the  infinite  bandwidth  (W  =  ») 
optimally  filtered  system.  Thus  the 


dB  loss  =  10  logj^Q  (D^p(W)/D2p^(»))  ,  (5.9) 

in  Figures  8  and  $.  Note  that  Figure  8  also  gives  the  dB 
loss  of  Dyjj(W)  relative  to  DQpp(«)  for  d  »  2.  This  is 
because  b  =  0  in  Figure  8,  and  the  noise  is  flat.  In  both 
figures  the  processing  loss  at  first  decreases  to  a  minimum, 
and  then  as  too  much  high  frequency  noise  is  processed  the 
variance  then  increases.  Since  the  signal  spectrum  is  fall¬ 
ing  off  at  6  dB/octave  faster  than  the  noise  spectrum,  the 
processor  will  behave  ever  more  poorly  as  the  processor 
bandwidth  is  made  larger  and  larger.  It  is  interesting 
that  a  W  of  3  or  corresponding  to  about  a  two  octave 
processing  band  yields  a  loss  of  only  3  or  4  dB. 


NORMALIZED  STANDARD  DEVIATION,  OPTIMUM,  d=l 


FIG.  8  SYSTEM  DEGRADATION  USING  jF^pl  .  d=2,  a=2,  b=0. 
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APPENDIX  A 


THE  CRAMER-RAO  MA'TRIX  BOUND 


The  Cramfr-Rac  matrix  bound  (CRMB)  is  the  generaliza¬ 
tion  to  the  vector  estimation  case  of  the  scalar  Cramer-Rao 
bound.  The  CRi'ffi  xs  useful  for  the  same  reason  that  the 
scalar  bound  is  useful — it  is  a  bound  for  all  estimators, 
and  can  be  calculated  from  a  knowledge  of  only  the 
probability  density  function  without  specifying  an 
estimator.  It  is  easily  derived,  as  follows: 

If  u,  V,  and  w  are  vectors,  and  H  is  a  matrix,  and 
if  only  V  and  w  are  random,  consider  the  quadratic  form 

0  <  u*^(v-Hw)  (v-Hw)'^u 


ITI  m  T*  T' 

=  u'^  (vv^+Hww  H'^-2Hwv  )u. 

T  -1 

Assume  <  ww  >  exists,  and  let 

T  T  — 

H  =  <  vv  >  <  ww  >  ^ 


(A.l) 


(A. 2) 


Take  the  expectation  of  both  sides  of  (A.l)  and  use  (A. 2). 
The  result  is  the  quadratic  form 

rn  rp  m  T  —  1  T 

0  £  u  (  <  vv  >  -  <  vw  ><  vw  >  <  wv  >)u. 

(A. 3) 


m 
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(A. 3)  shows  that  the  matrix  in  the  quadratic  form 
of  (A. 3)  is  nonnegative  definite,  since  u  is  an  arbitrary 
vector.  Thus 

m  m  m  1  m 

U.H) 


T  T  T  -1  T 

<  vv  >  >  <  vw  >  <  ww  >  <  wv  > 


If  f  is  a  scalar  function  of  the  vector  y,  let  the 
gradient  of  f  with  respect  to  y  be  written  as  a  row  vector 


_  /3f  3f  3f  V 

grad  f  -  gy^) 


(A. 5) 


If  g  is  a  column  vector,  let  grad  g  be  that  matrix 
in  which  the  i-th  row  is  the  gradient  (with  respect  to  y) 
of  the  i-th  component  of  g. 

grad  g^ 


grad  g  = 


Lgrad  g, 


nJ 


(A. 6) 


Consider  a  nonrandom  vector  estiiudcion  problem.  The 
parameter  vector  to  be  estimated,  y,  and  the  vector  of 
observations,  x,  are  the  arguments  of  the  probability 
density  function  p(x|y).  Let  .v(=  y(x))  be  an  estimate  for 
y  based  on  the  observation  vector  x.  With  the  gradients 
taken  with  respect  to  the  components  of  y. 
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This  Is  known  as  the  Cram^r-Rao  Matrix  Bound  (CRMB). 

The  matrix 

(PIM)  =  <  (grad  In  p(x|y) )'^(grad  In  p(x|y))  > 

(A. 12) 

is  called  the  Fisher  Information  Matrix  (PIM) . 

A  relevant  question  is  whether  it  is  possi'  to 
achieve  ev^uality  in  (A.  11).  This  is  possible  only  if 
there  is  equality  in  (A.l),  which  occurs  only  if  v  =  Kw. 

Note  that  H  as  defined  in  (A. 2)  is  not  random,  that  is,  it 
is  not  a  matrix  function  of  the  data.  H  is  at  most  a 
matrix  function  of  y.  Assume  exists.  Then  equality  in 
(A. 11)  is  possible  if  and  only  if 

grad  In  p(xly)  =  H”^(y) (y (x)-y)  .  (A. 13) 

That  is,  grad  In  p(x|y)  can  be  factored  into  a  matrix 
product,  whore  H”^(y)  is  a  matrix  that  does  not  depend  on 
the  data. 

The  development  above  is  patternea  after  Balakrishnan 

[12]. 

An  alternate  form  of  the  PIM  is  often  more  convenient 
to  use  than  the  right  side  of  (A, 13).  It  is  derived  as 
follows; 
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APPENDIX  B 

COMPLEX  GAUSSIAN  RANDOM  VARIABLES 
Let  X  and  y  be  real  Gaussian  random  n- vectors;,  with 
means  x  and  y,  respectively.  The  joint  probability  density 
function  for  x  and  y  is 

p(x,y)  =  (2u)""  (det  exp  ( 4[yly ]  ^’^[y-y] 

(B.l) 

where 


Let  tne  complex  random  vector  z  h  x  +  jy,  where  the 
real  and  imaginary  parts  are  distributed  according  to  (B.l). 
Under  certain  practical  and  important  conditions  the  Joint 
density  function  can  be  written  as  a  function  of  the  complex 
random  vector  z,  rather  than  as  a  function  of  both  x  and  y. 
This  is  desirable  in  that  expectations  of  functions  of  z, 
and  also  investigations  of  the  properties  of  the  likelihood 
function,  become  much  more  tractable. 

It  is  convenient  to  state  certain  matrix  results. 
Consider  matrices  of  the  form  C  =  V  +  jW, 

where  V  and  W  are  real  square  n  x  n  matrices.  Then  the 
following  statements  are  true: 
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1.  Matrices  of  the  form  R  and  C  are  isomorphic  under  matrix 
addition  and  matrix  multiplication. 

2.  R  is  symmetric  <=>  C  is  Hermitian  (C  =  C  ) 

<4^  V  =  and  W  =  -W^. 

3.  C“^  =  A  +  JB  exists  R"^  =  B  "a  exists. 

4.  R  is  orthogonal  (R*^  =  C  is  unitary  (C“^  = 

5.  R  is  symmetric  positive  definite  C  is  Hermitian 
positive  definite. 

6.  If  V  =  and  W  =  -W*^,  if  x  and  y  are  n  vectors,  and  if 
z  =  X  +  jy,  then 


and 


(x^  y^) 


(x^  y^) 


"v  -w' 

X 

Lw  vJ 

_y_ 

L«  Vj  ^yj 


T  » 

Z  CZ  , 


T«-l  * 
=  z-^C  -^z  , 


7.  If  V  = 


and  W  =  then 


det  R  =  (det  C)^  . 


(B.3) 


(B.4) 


(B.5) 


Statements  i  through  6  are  easy  to  verify.  Only 
statement  7  requires  some  justification.  In  Mathews  and 
Walker  [13,  p.  155]  it  is  shown  that  a  complex  matrix  C  • 
can  be  diagonalized  by  a  unitary  matrix  T  if  and  only  if 

C  and  C  ^  commute  (C  *’0  =  CC  ).  If  C  is  Hermitian  (C  =  C  ) 

*T 

then  C  and  C  obviously  commute.  Let  T  be  the  unitary 
transformation  that  diagonalizes  the  Hermitian  matrix  C. 

Then  T-^CT  =  0,  where  D  is  a  dj agonal  matrix.  Since  det  T  = 
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(det  det  C  =  det  D.  Let  T,  C,  and  D  be  the  matrices 

isomorphic  to  T,  Cj  and  D,  respectively.  Then  D  =  (T)'’^CT, 
so  that  det  D  =  det  C.  But  C  *  R,  and  D  =  q  ,  so  that 

det  D  =  (det  D)^.  Thus  det  R  =  (det  C)^. 

Thus,  11'  z  =  X  +  jy,  where  x  and  y  are  Gaussian  n- 
vectors  distributed  by  (B.l),  and  if 


P  =  - 
^  2 


V  -W 
W  V 


(B.5) 


v;hsre  Y  =  and  W  =  -W^,  then  using  (B.^)  and  (B.5)  the 
density  function  (B.l)  can  be  written  as 


p(z)  =  (det  C)"^  exp{-(z-z)'^*C“’^  (z-z)}  (^.6) 


where  C  «  v  +  jW  and  z  =  x  +  Jy.  The  complex  vector 
argument  of  (B.6)  is  called  a  complex  Gaussian  random 
vector. 

Complex  Gaussian  random  vectors  and  their  properties 
have  been  extensively  studied  by  Goodman  [4].  In  [i<] 
Goodman  develops,  among  other  things,  results  pertaining 
to  characteristic  functions,  maximum  likelihood  estimation 
of  C,  and  distribution  functions  for  the  maximum  likelihood 
estimates . 

Whether  or  not  the  complex  random  variables  are 
distributed  by  (B.6),  the  relation 
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<  ZnZ^Z^Z),  >  =  <  ZtZ^  >  <  ZoZ„  >  +  <ZtZ^  >  <  ZgZ^  > 


'1^2  3  ^ 


'1"2 


3^1, 


'1“3 


+  <  Z^Z|^  >  <  Z^Z^  > 


-  2  <  z^  >  <  Z2  >  <  z^  >  <  z^  > 


(B.7) 


holds  if  the  real  and  imaginary  parts  of  the  complex 
random  variables  in  (B.7)  are  Gaussian  random  variables. 
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